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PSEUDO-DYNAMIC LATCH DERACER 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

[1] The invention relates to integrated circuits and in particular to logic timing 
solutions for a pseudo-dynamic latch. 

2. Background of the Related Art 

[2] One of problem of domino logic circuits is they cannot implement inverting 
logic (e.g., NAND, NOR, and the like). Accordingly, to implement inverting logic with 
domino logic circuits, pseudo-dynamic latches can be used to implement inverting logic 
without additional stage delays, as is known in the art. 

[3] A pseudo-dynamic latch is simply a pseudo-dynamic circuit with a latch 
function, as illustrated in Figure 1. The pseudo-dynamic latch is known in the art and 
therefore a detailed description of the circuit will not be presented. 

[4] Referring to Figure 1, select signals SI and S2 are domino signals, and some of 
the data inputs are domino, and some are domino-precharge (or buffered domino- 
precharge), e.g., data signal Dl, as will be appreciated by those skilled in the art. The selects 
SI and S2 are mutually exclusive, and can arrive relatively earlier than the data signals Dl and 
D2. When buffered inverted clock signal NCKB is high, it is in the evaluation phase, and 
when NCKB goes low, it is in hold state just like a conventional latch. However, a special 
case occurs when none of the select signals SI and S2 are on. In this case, the circuit may be 
in the evaluation phase as determined by the status of NCKB, but both the pull-up 130 and 



pull-down paths 140 and 150 will be off due to the selects being off. Therefore, the circuit is 
in functionally in the hold mode. 

[5] As the operating frequency and complexity of logic circuits increase, the 
potential for timing errors increase and margins decrease. Modern circuit design includes 
complex modeling of signals with in a circuit and to determine potential errors due to timing 
problems, such as min-delay, max-delay, and the like. Accordingly, reliable timing is 
important for reliable operation of logic circuits. 

[6] For example, assume a circuit design showed a speed path in one of the 
domino-precharge signals (e.g., data signal Dl). Initially, the domino-precharge signal from 
domino driver 102 had a buffer 110 used as the deracer. In order to solve the speed path 
problem, the buffer 110 could be removed. However, subsequent analysis of the design 
then indicated a violation in the min-delay (i.e., the minimum delay for consistent data 
latching). An illustration of this problem is provided in the following section in relation to 
Figure 2. 

[7] Referring to Figure 2, assume the node PD is high during the evaluation 
phase. If signal NCKB does not go low by the time the precharge value of data signal Dl 
arrives (e.g., time 206), then a min-delay situation will occur. As discussed above select 
signal SI goes high at time 202, before data signal Dl arrives at time 204. However, since at 
time 208 precharge data signal Dl is high, select SI is still high, and NCKB has not gone low 
due to the skew 201, the new data held 210 is the wrong value. That is at time 208 the 
wrong value of PD will be latched. In this example, the min-delay was mosdy due to the 
high skew 201 between data signal Dl and the buffered inverted clock signal NCKB. Those 



skilled in the art will appreciate that the skew can be due to a variety of sources, such as the 
effects of process variation on the clock, and the like. Therefore, as illustrated in Figure 2, 
the related art circuit does not provide for reliable operation. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[8] The invention will be described in detail with reference to the following 
drawings in which like reference numerals refer to like elements wherein: 

[9] Figure 1 illustrates a schematic diagram of a pseudo-dynamic latch in 
accordance with the related art; 

[10] Figure 2 illustrates a timing diagram of signals of the pseudo-dynamic latch in 
accordance with the related art; 

[11] Figures 3A-C illustrate schematic diagrams of a pseudo-dynamic latch in 
accordance with embodiments of the present invention; 

[12] Figures 4A and 4B illustrate timing diagram of signals of the pseudo-dynamic 
latch in accordance with embodiments of the present invention; 

[13] Figure 5 illustrates a schematic diagram of a pseudo-dynamic latch in 
accordance with another embodiment of the present invention; 

[14] Figure 6 illustrates a system level block diagram of a computer system 
according to an embodiment of the present invention; and 

[15] Figures 7 A and 7B illustrate methods according to embodiments of the 
present invention. 
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DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 



[16] In the following detailed description of preferred embodiments, reference is 
made to the accompanying drawings that show, by way of illustration, specific embodiments 
in which the invention can be practiced. In the drawings, like numerals describe 
substantially similar components throughout the several views. These embodiments are 
described in sufficient detail to enable those skilled in the art to practice the invention. 
Other embodiments may be utilized, and structural, logical, and intellectual changes may be 
made without departing from the scope of the present invention. Moreover, it is to be 
understood that various embodiments of the invention, although different, are not 
necessarily mutually exclusive. For example, a particular feature, structure, or characteristic 
described in one embodiment may be included within other embodiments. Accordingly, the 
following detailed description is not to be taken in a limiting sense, and the scope of the 
present invention is defined only by appended claims, along with the full scope of 
equivalence to which such claims are entitled. 

[17] In order to solve the min-delay problem discussed in the foregoing section, a 
solution was designed that does not use a complex deracing structure to resolve the min- 
delay problem and does not degrade the speed path. Since the select signal SI arrives before 
the data signal Dl, the deracer circuit can be used on SI and not affect the critical data path, 
as can be observed from Figures 4A and 4B. Particularly, preventing the select signal SI 
from being high when a precharge edge of data signal Dl arrives allows for reliable latch 
operation. This observation can serve as a basis for understanding the following description 
of embodiments of the present invention. 



[18] Referring to Figure 3A, a schematic diagram of a pseudo-dynamic latch in 
accordance with an embodiment of the present invention is illustrated that solves the 
previously described min-delay problem. The pseudo-dynamic latch circuit including deracer 
circuit 300A ensures that the select signal SI is not high when a precharge edge of data signal 
Dl arrives. Deracer circuit 300A comprises two gates, for example, inverter 302 and NOR 
gate 304, as illustrated in Figure 3A. However, those skilled in the art will appreciate that 
other logic configurations can be used. Reference will now be made to Figure 4A, which is a 
timing diagram that illustrates the solution rendered by the circuit of Figure 3A. 

[19] Referring to Figure 4A, initially, select signal SI will turn on and data signal 
Dl is precharged high, thus node PD will be pulled low at time 402. When a critical edge 
404 of data signal Dl arrives from domino driver 312, which is high going low, the PD node 
will be charged high. At this point, whether SI is high or low will not change the value of 
node PD, since the transistor 342 controlled by select signal SI is in series with the transistor 
344 controlled by data signal Dl, as can be seen with reference to Figure 3A. 

[20] Specifically, referring to Figure 3, n-type metal oxide semiconductor (NMOS) 
pull-down transistors 342 and 344 are in series with clocked NMOS transistor 360 and form 
pull down path 340. Likewise, NMOS pull-down transistors 352 and 354 are in series with 
clocked NMOS transistor 360 and form pull down path 350. Accordingly, for a pull down ' 
path to be active (i.e., pull down node PD) all transistors in the pull down path must be on. 
Accordingly, pull down path 340 is active when select signal SI driving transistor 342, data 
signal Dl driving transistor 344 and signal NCKB driving transistor 360 are all high. 
Likewise, pull down path 350 is active when select signal S2 driving transistor 352, data 



signal Dl driving transistor 354 and signal NCKB driving transistor 360 are all high. On the 
contrary, those skilled in the art will appreciate that pull up path 330 is active when the input 
signal (e.g., PGATE) is low since the pull up transistor is a p-type metal oxide 
semiconductor (PMOS). The foregoing description was provided solely for illustration and 
not for limitation. Those skilled in the art will appreciate that alternative arrangements of 
the circuit can include alternative arrangements of the pull up and pull down paths and of 
the p-type and n-type transistors along with corresponding logic changes. The foregoing 
description briefly describes some of the illustrated elements as a basis for further discussion 
of the following embodiments of the present invention and related timing diagrams. Since 
the min-delay problem discussed above occurs on the Dl portion of the circuit, only this 
portion of the circuit will be further discussed in detail. 

[21] Referring to both Figures 3 A and 4A, in the illustrated embodiment the 
deracer circuit 300A will pull the select signal SI to low at time 406, two gate delays after 
signal Dl goes low at time 404. The two gate delay is due to Dl being inverted by inverter 
302 and then the output Dl-bar being NORed with SI -bar by NOR gate 304, as illustrated 
in Figure 3A. Likewise, the select signal SI will not go high until time 410, which is two gate 
delays after data signal Dl is precharged to high at time 408. Accordingly, the min-delay is 
corrected as long as the select signal SI goes high after NCKB goes low, which is illustrated 
in Figure 4A. 

[22] Another embodiment of the present invention uses a similar structure as 
illustrated in Figure 3A (i.e., the deracer circuit 300A the dotted circle). However, instead of 
NORing the inverted logic select signal SI -bar, and inverted data signal Dl-bar, signal SI -bar 
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and NCK-bar can be NORed in deracer circuit 300B, as illustrated in Figure 3B. 
Specifically, NCK is inverted by inverter 302 to provide signal NCK-bar to NOR gate 304. 
NOR gate 304 generates select signal SI by NORing NCK-bar and Sl-bar, as illustrated. In 
this embodiment select signal SI will be deactivated two gate delays after NCK falls. This 
will be approximately the same as the fall of NCKB. The timing of this embodiment is 
discussed below, with reference to Figure 4B. 

[23] Figure 3C illustrates another embodiment of the present invention uses a 
clock signal CK that can be introduced to improve the timing of the embodiment of Figure 
3B. In this embodiment, the NOR gate 304 has input signals Sl-bar and clock CK, as 
illustrated in deracer circuit 300C. This deracer circuit 300C causes select signal SI to fall 
one gate delay after clock signal CK rises, which is the falling edge of NCK. However, this 
embodiment can consume extra power because of the extra gate and wire capacitance 
needed to route the clock signal CK, as will be appreciated by those skilled in the art. For 
example, one extra track in the layout is used to add clock signal CK. Additionally, although 
this embodiment gains one gate delay of margin over the embodiment of Figure 3B, the 
margin increases only if the gate delay decreases. There can be a limit to how much margin 
can be gained depending on how fast the NOR gate delay can be attained. Further, 
introducing one more clock signal CK will introduce more skew to the circuitry, which tends 
to make the circuitry less reliable. The timing of this embodiment is also discussed below, 
with reference to Figure 4B. 

[24] Figure 4B illustrates a timing diagram for the embodiments of Figures 3B and 
3C The clock signal CK is illustrated at the top of the timing diagram. The result of the 
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deracer circuit 300C with respect to select signal SI is illustrated at time 422. The result of 
the deracer circuit 300B with respect to select signal SI is illustrated at time 424. Since signal 
SI -bar is NORed with the clock signal CK in deracer circuit 300C, the select signal SI goes 
low at time 422 after the rising edge of clock signal CK. Likewise, SI goes low at time 424 
after the falling edge of inverted clock signal NCK, since the Sl-bar signal is NORed with 
the NCK-bar signal in deracer circuit 300B. Time 424 also corresponds to the precharge 
edge of the data signal 424. Both deracer circuits 300B and 300C result in the select signal 
SI being low at the precharge edge of data signal Dl. However, as noted above, using the 
clock signal CK gains a one gate delay margin (i.e., the difference between time 422 and 
424), as illustrated in Figure 4B. 

[25] However, the embodiment of Figure 3A does not add extra load to the clock, 
requires no additional track for new signals, and is very robust. Further, the margin can be 
increased by increasing the delay of inverter 302 that generates the Dl-bar signal that is 
inputted into NOR gate 304. Those skilled in the art will appreciate that increasing the delay 
is easier to achieve than decreasing the delay. Another advantage of increasing the delay of 
the inverter 302 is that it does not affect the speed of the rising edge of SI. The NOR gate 
304 causes the select signal SI rising edge to occur on the falling of Sl-bar and Dl-bar. 
Further, one can easily see from Figure 4A that data signal Dl rises during the NCKB low 
phase. Consequendy, there is plenty of time for Dl-bar to fall to low before Sl-bar falls to 
low (because Sl-bar falls to low during NCKB high phase). Therefore, the min-delay margin 
can increase without aggravating the max-delay margin. The max-delay margin for select 
signal SI is increased because it is generated by NOR gate 304, instead of using a simple 
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inverter. However, the increase in delay is minimal because SI arrives significantly earlier 
compared to the data signal Dl. 

[26] Accordingly, an embodiment of the present invention includes an apparatus 
comprising a pseudo-dynamic latch and a deracer circuit configured to prevent a select signal 
from being high when a precharge edge of a data signal arrives. An embodiment of the 
pseudo-dynamic latch and a deracer circuit is illustrated in Figure 3A. The deracer circuit 
300A includes a first logic gate 302 (e.g., an inverter) configured to receive a data signal Dl 
from a domino logic circuit and to invert the data signal Dl. A second logic gate 304 is also 
included and is configured to receive the inverted data signal Dl-bar and an inverted select 
signal Sl-bar and to generate a select signal SI. The deracer circuit 300A is configured to 
generate a state change of the select signal SI two gate delays after a state change of the data 
signal Dl, as illustrated in Figure 4A. Thus, the deracer circuit 300 is configured to prevent 
the select signal SI from going to a high state before the pseudo-dynamic latch goes into a 
hold state from an evaluate state (e.g., at time 410). 

[27] Additionally, an alternative embodiment of the present invention includes a 
deracer circuit comprising a first logic gate 302 and second logic gate 304 (e.g., as illustrated 
in Figure 3B). The first logic gate 302 is configured to receive an inverted clock signal NCK 
and to invert the inverted clock signal and generate NCK-bar. The second logic gate 304 is 
configured to receive the signal NCK-bar and an inverted select signal Sl-bar and to 
generate a select signal SI. Thus, the deracer circuit 300B is configured to prevent the select 
signal SI from going to a high state before the precharge edge of the data signal Dl arrives. 

[28] Further, another embodiment of the present invention includes a deracer 
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circuit 300C comprising a logic gate 304 (e.g., as illustrated in Figure 3C). The logic gate 304 
is coupled to a clock signal CK and an inverted select signal SI -bar. The logic gate 304 
generates a select signal SI. Thus, the logic gate 304 prevents the select signal SI from going 
to a high state before the precharge edge of the data signal Dl arrives. 

[29] Another solution is illustrated for the pseudo-dynamic latch circuit is 
illustrated in Figure 5. For purposes of illustration, only the pull down paths 510 and 520 of 
data signals Dl and D2 are illustrated. As can be seen from a comparison of Figure 1 and 
Figure 5, an additional NMOS transistor 560 has been added in the D2 pull down path. 
Additionally, the inverted clock signal NCK is not buffered in the Dl pull down path 510, 
but is buffered by buffer 522 in the D2 path 520. This allows for removal of min-delay 
buffer 110 and improved timing in the data signal Dl path. However, this improvement in 
the data signal Dl path comes at the cost of increased load and power consumption on the 
inverted clock signal NCK. Further, additional transistor 502 increases the area used to 
realize the circuit and increases power consumption. 

[30] Accordingly, embodiments of the present invention do not aggravate the max- 
delay margin while increasing the min-delay margin with robustness. Conventional min- 
delay deracers will involve a tradeoff between max-delay and min-delay margins. Those 
skilled in the art will appreciate that embodiments of the present invention are useful in any 
circuit design that uses pseudo-dynamic latches in the similar way. An example of a use of 
this pseudo-dynamic latch is a multiplexer (mux) latch in a microprocessor. 

[31] As can be appreciated from the foregoing description, embodiments of the 
present invention can be use in a wide variety of applications including computer systems. 
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Figure 6 shows an exemplary illustration of a computer system. The computer system can 
include a microprocessor 2, which can include many sub-blocks such as an arithmetic logic 
unit (ALU) 4 and an on-die cache 6. The microprocessor 2 may also communicate to other 
levels of cache, such as off-die cache 8. Higher memory hierarchy levels such as system 
memory 10 are accessed via host bus 12 and a chip set 14. In addition, other off-die 
functional units such as a graphics accelerator 16 and a network interface controller 18, to 
name just a few, may communicate with the microprocessor 2 via appropriate busses or 
ports. Embodiments of the present invention can include system components that use 
pseudo-dynamic latches, such as an ALU, multiplexer, microprocessor, memory circuits, and 
the like. 

[32] Accordingly, and embodiment of the present invention can include a system 
comprising: a microprocessor; and an off-die component in communication with the 
microprocessor. The microprocessor comprises a pseudo-dynamic latch having a deracer 
circuit as detailed in the foregoing description. The deracer circuit comprises a first logic 
gate and a second logic gate. The first logic gate is configured to receive a data signal from a 
domino logic circuit and to invert the data signal The second logic gate is configured to 
receive the inverted data signal and an inverted select signal and to generate a positive select 
signal. 

[33] Further, those skilled in the art will appreciate from the foregoing description 
methods in accordance with embodiments of the present invention. Accordingly, an 
embodiment of the present invention is illustrated in Figures 7A and 7B. The method 
comprises receiving a data signal Dl from a domino logic circuit in block 710 and 
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preventing a select signal SI from being high when a precharge edge of the data signal Dl 
arrives in block 720. 

[34] For example, as illustrated in Figure 3 A, the data signal Dl can be inverted 
with a first logic gate 302. The inverted data signal Dl-bar and an inverted select signal Sl- 
bar can be received by a second logic gate 304 and the second logic gate 304 can generate 
the select signal SI. Data signal Dl can be from a domino logic circuit input to the pseudo- 
dynamic latch. 

[35] Accordingly, as illustrated in Figure 7B, the method of block 720 can include 
inverting the data signal Dl with a first logic gate, in block 722 and receiving the inverted 
data signal Dl-bar and an inverted select signal SI -bar and generating the select signal SI 
with a second logic gate, in block 724. A state change of the select signal SI is generated two 
gate delays after a state change of the data signal Dl. Thus, the method prevents the select 
signal SI from going to a high state before the precharge edge of the data signal Dl arrives. 

[36] The foregoing embodiments and advantages are merely exemplary and are not 
to be construed as limiting the present invention. The present teaching can be readily 
applied to other types of apparatuses. The description of embodiments of the present 
invention is intended to be illustrative, and not to limit the scope of the claims. Many 
alternatives, modifications, and variations will be apparent to those skilled in the art. In the 
claims, means-plus- function clauses if used are intended to cover the structures described 
herein as performing the recited function and not only structural equivalents but also 
equivalent structures. 
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